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Abstract 

In the Matroid Secretary Problem, introduced by Babaioff et al. [SODA 2007], the elements 

of a given matroid are presented to an onhne algorithm in random order. When an element 

is revealed, the algorithm learns its weight and decides whether or not to select it under the 

^ I restriction that the selected elements form an independent set in the matroid. The objective 

is to maximize the total weight of the chosen elements. In the most studied version of this 

CO I problem, the algorithm has no information about the weights beforehand. We refer to this as 

the zero information model. In this paper we study a different model, also proposed by Babaioff 

et al., in which the relative order of the weights is random in the matroid. To be precise, in 

^Q ■ the random assignment model, an adversary selects a collection of weights that are randomly 

^\ I assigned to the elements of the matroid. Later, the elements are revealed to the algorithm in a 

• . random order independent of the assignment. 

fi I Our main result is the first constant competitive algorithm for the matroid secretary problem 

in the random assignment model. This solves an open question of Babaioff et al. Our algorithm 

achieves a competitive ratio of 2e^/(e — 1). It exploits the notion of principal partition of a 

^ . matroid, its decomposition into uniformly dense minors, and a 2e-competitive algorithm for 

^S| ' uniformly dense matroids we also develop. As additional results, we present simple constant 

VO ■ competitive algorithms in the zero information model for various classes of matroids including 

^^ ' cographic, low density and the case when every element is in a small cocircuit. In the same 

. . model, we also give a fce-competitive algorithm for fc-column sparse linear matroids, and a new 

P^ ' 0(logr)-competitive algorithm for general matroids of rank r which only uses the relative order 
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of the weights seen and not their numerical value, as previously needed. 
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1 Introduction 

In the simplest form of the secretary problem, an employer wants to select the best secretary among 
n candidates arriving in random order. Once a secretary is interviewed, the employer must decide 
immediately whether to accept the candidate or not and that decision is final. Lindley [18] and 
Dynkin [10] have shown that sampling the first [n/e\ candidates and then selecting the first one 
whose value is higher than all the sampled ones gives a probability of at least 1/e of selecting the 
best secretary and that no algorithm can beat this constant. An important generalization of this 
problem with many applications is known as the multiple choice secretary problem, (see |15])- In 
this problem we wish to select a set of at most k secretaries from a pool of n applicants and the 
objective is to select a group of combined value as high as possible. 

Babaioff et al. [4j introduce the generalized secretary problem, as a natural class of extensions 
of the previous problem in which the set returned by the algorithm must obey some combinatorial 
restriction. In this setting, a finite set E with hidden nonnegative weights and a collection of 
subsets Z C 2^ closed under inclusion are given. The collection I describes the sets of elements 
that can be simultaneously accepted (these are the feasible sets or the domain of the problem). The 
elements of E are presented to an online algorithm in random order. When an element is revealed, 
the algorithm learns its weight and decides whether or not to accept it under the restriction that 
the set of accepted elements is feasible. This decision is irreversible and it must be taken before 
the next element is revealed. The objective is to output a feasible set of maximum total weight. 

We remark that other lines of generalizations of the multiple choice secretary problem having 
different objective functions have also been considered. These generalizations include, among oth- 
ers, minimizing the sum of the relative ranks of the selected elements (studied by Ajtai et al. [1]), 
the weighted and time discounted secretary problems of Babaioff et al. |2j, the J-choice i^-best 
secretary problem studied by Buchbinder et al. ^ and the submodular secretary problem of Bateni 
et al. [6]. 

The generalized secretary problem is of interest due to its connection to online auctions. In both 
the original and multiple choice secretary problems, we can regard the algorithm as an auctioneer 
having one or many identical items, and the secretaries as agents arriving at random times, each 
one having a different valuation for the item. The goal of the algorithm is to assign the items to 
the agents as they arrive while maximizing the total social welfare. In more complex situations, 
the algorithm may be considered to have access to a collection of goods that it wishes to assign to 
agents, subject to some restrictions. In many cases, these restrictions can be modeled by matroid 
constraints. For that reason, the matroid secretary problem, in which the feasible sets are the 
independent sets of a matroid is of special interest (see e.g. a survey of Babaioff et al. [4j). 

Notice that the difficulty of the problem changes depending on the information we know be- 
forehand about the weights. We recognize four different models in increasing order of difficulty. 

• Full information model: The weights are chosen i.i.d. from a known distribution. 

• Partial information model: The weights are chosen i.i.d. from an unknown distribution. 

• Random assignment model: An adversary chooses a list of nonnegative weights, which 
are then assigned to the elements using a uniform random one-to-one correspondence, which 
is independent of the random order at which the elements are revealed. 

• Zero information model: An adversary assigns the weights arbitrarily. 

The difficulty may also change depending on whether the algorithm learns the actual weight of 
the elements or just the relative order of the weights seen so far. See the surveys of Freeman |13] 
and Ferguson [12J for references and variations of these models in the classical secretary problem. 



Note that for both the classical problem and the multiple choice problem, the random assignment 
and the zero information models coincide. 

There has been a significant amount of work on the matroid secretary problem under zero 
information. Constant competitive algorithms are known for partition matroids (this corresponds 
to the classical [181 ID] and multiple choice secretary problem [151 E]) a^id also for graphic and 
transversal matroids [51I91I161I2]. It is also known [5] that if a matroid admits a constant competitive 
algorithm under zero information, then so do its restrictions and truncations. For general matroids, 
the best algorithm known so far, due to Babaioff et al. [3, is 0(logr)-competitive, where r is the 
rank of the matroid. 

Non-matroidal domains have also been considered in the literature. Babaioff et al. [3J show 
a lOe-competitive algorithms for knapsack domains even in the case where both the weights and 
lengths are revealed online. Korula and Pal |16j give constant competitive algorithms for some cases 
of intersection of partition matroids under zero information, specifically for matchings in graphs 
and hypergraphs where the edges have constant size. 

Not every domain admits constant competitive algorithms. Babaioff et al. (5] have shown a 
particular domain for which no algorithm has a competitive ratio smaller than o(log n/ log log n) 
even in the full information model. However, matroid domains have the following special prop- 
erty: If we are allowed to reject elements which have been previously accepted, while keeping at 
every moment an independent set, then it is possible to output the optimum independent set no 
matter in which order the elements are presented. This intuition motivated Babaioff et al. [5l H] 
to conjecture that the matroid secretary problem admits a constant competitive algorithm even 
under zero information. According to these authors, this question is also non-trivial for the random 
assignment model, and for the model in which the order the elements are presented is adversarial, 
but the weights are randomly assigned from a pool of hidden values. 

Main Result In this paper, we answer the last question affirmatively for the random assignment 
model, exhibiting a 2e^/(e — l)-competitive algorithm for any matroid domain. We remark here 
that our results also apply to the partial and full information settings for which, as far as we know, 
no previous results existed. Our algorithm uses as building block an algorithm for uniformly dense 
matroids we also develop. 

Uniformly dense matroids are matroids for which the density of a set, that is, the ratio of its 
cardinality to its rank, is at most the density of the entire ground set. The simplest examples 
of these are precisely the uniform matroids. Uniformly dense matroids and uniform matroids of 
the same rank over the same ground set behave similarly, in the sense that the distribution of the 
rank of a random set is similar for both matroids. We use this fact to devise a 2e-competitive 
algorithm for these matroids in the random assignment model. In order to extend this algorithm 
to general matroids we exploit some notions coming from the theory of principal partitions of a 
matroid, particularly its principal sequence. Roughly speaking, the principal sequence of a matroid 
A^ is a decomposition of its ground set into a sequence of parts, each of which is the underlying 
set of a uniformly dense minor of A4 . By employing independently the previous algorithm in each 
of these minors, we obtain an algorithm that returns an independent set of M, while loosing only 
an extra factor of 1 — 1/e on its competitive ratio. 

Additional Results We also develop simple constant competitive algorithms under zero infor- 
mation for various classes of matroids including cographic, low density and the case when every 



element is in a small cocircuit. Also, we show a fce-competitive algorithm for the case when the 
matroid is representable by a matrix in which each column has at most k non-zero entries. This 
result generalizes the 2e-competitive algorithm for graphic matroids of Korula and Pal [16j . Finally, 
we give a new 0(logr)-competitive algorithm for general matroids. Unlike the previous algorithm 
of Babaioff et al. [5], our algorithm does not use the numerical value of the weights. It only needs 
the ability to make comparisons among seen elements. 

2 Matroid Secretary Problem in the Random Assignment Model 

In this paper, we assume familiarity with basic concepts in matroid theory. For an introduction 
and specific results, we refer to Oxley's book [22j. 

Consider a matroid M = {E,I) with ground set E = {ei, . . . ,6^}. An adversary selects a set 
W of n nonnegative weights wi > ■ ■ ■ > Wn > 0, which are assigned to the elements of the matroid 
via a random permutation o" : [n] — > [n], i.e., the weight function of the elements w : E ^ W is such 
that w{efju\) = Wi. The elements are then presented to an online algorithm via a random order 
TT : [n] — >■ [n]. When an element is presented, the algorithm must decide whether to add it or not to 
the solution set under the condition that this set is independent in M at all times. The objective 
is to output a solution set ALG whose value i(;(ALG) = YlieeAhG ^(^) ^^ ^^ high as possible. 

We further assume that when the z-th element of the stream, e^(j), is presented, the algorithm 
only learns the relative order of the weight with respect to the previously seen ones. This is, it can 
compare tt)(e^(j)) with t«(e^(fc)) for all j, k < i, but it can not use the actual numerical values of the 
weights. Without loss of generality, we can assume that there are no ties in W, because otherwise 
we can break them using a new random permutation r; if the comparison between two elements 
seen gives a tie, then we consider heavier the one having larger r-value. 

For a given permutation a, let OPT^vt (cr) be the the lexicographic first base of Ai according 
to the permutation a. In other words, 0FT_\4{a) is the set obtained by applying the greedy 
procedure that selects an element if it can be added to the previously selected ones while preserving 
independence in A^, on the sequence e^(^i),efj(^2)i ■ ■ ■ i^a{n)- Standard matroid arguments imply 
that OPT;v( (cr) is a maximum independent set with respect to any weight function v for which 
■^(6o-(i)) > ■ ■ ■ ^ "f^i^ain)) ^ 0- III particular, this is true for the weight function w defined before. 
We will drop the subindex Al in 0PT;v((c7) whenever there is no possible confusion. 

We say that an online algorithm returning an independent set ALG is a-competitive if for any 
selection of nonnegative weights W given by the adversary, aE^_o-['ii'(ALG)] > ¥,f^[w{OPT{a))]. 

As a particular case, consider the partial and full information models in which elements receive 
their weights independently from a fixed distribution V over the nonnegative reals. Since it is 
possible for the expected weight of the optimum and the expected weight of the set returned by the 
algorithm to be both infinite (for instance, if V has infinite mean), the concept of competitiveness 
for this scenario has to be slightly modified. We say that an algorithm returning an independent set 
ALG is a-competitive in the partial or full information models if Ex),7r[au)(ALG) — u;(OPT)] > 0. 

We claim that any algorithm that is a-competitive in the random assignment scenario is also 
a-competitive in both the full and the partial information setting. To see this, consider an a- 
competitive algorithm A under the former model, and apply to one of the latter. Note that the 
distribution of the maximum independent set is the same as the one obtained by first choosing a 
set I^ of n sample values from V, and then assigning these values to the elements using a uniform 
random permutation a. For any realization of values of W according to T>, algorithm A returns a 



set ALG such that E,r,o-[c^^i'(ALG) — w{OPT{a))] > 0. By taking expectation over the reahzations 
of W we prove the claim. 

3 Uniformly dense matroids 

Define the density 'y{M) of a loopless matroiiXA -M = {E,I) with rank function r, to be the 
maximum over ah non-empty sets \X\, of the quantity \X\/r{X). The matroid Ai is uniformly 
dense if 7(A^) is attained by the entire ground set; that is, if ^Wr < :p7^, for every non-empty 
X C E. Examples of uniformly dense matroids include uniform matroids, the graphic matroid of 
a complete graph, and all projective geometries PG{r — 1,F). The following property of uniformly 
dense matroids is important for our analysis. 

Lemma 3.1. Let (xi, . . . ,Xj) be a sequence of different elements of a uniformly dense matroid 
chosen uniformly at random. The probability that element Xj is selected by the greedy procedure on 
that sequence is at least 1 — (j — l)/r, where r is the rank of the matroid. 

Proof. An element is selected by the greedy procedure only if it is outside the span of the previous 
elements. Denote by Ai = {xi, . . . , Xj} to the set of the first i elements of the sequence, and let n 
be the number of elements of the matroid, then: 

T^ r . 1 n n — span(j4,_i) n — r(j4,_i)n/r n — (j — l)n/r , ,. ,, , 

Pr a;,- is selected = ^.^ ^.^ > \^. [' > ^-^ , . \' > 1 - (j - l)/r, 

^ J n - (j - 1) - n - - 1) - n - (j - 1) - 

where the first inequality holds since the matroid is uniformly dense and the second holds because 
the rank of a set is always at most its cardinality. D 

3.1 A 2e-competitive algorithm for uniformly dense matroids 

Consider the following algorithm for uniform matroids of rank r due to Babaioff et al. [3]: Maintain 
the set T consisting on the r heaviest elements seen so far (initialize this set with r dummy elements 
that are considered to be lighter than everyone else). Observe the first m = pn elements without 
adding them to the output; we refer to this set as the sample. An element arriving later will be 
added to the output only if at the moment it is seen, it enters T and the element that leaves T is 
either in the sample or a dummy element. Babaioff et al. have shown that this algorithm returns 
a set of at most r elements and that by setting p to be 1/e, every element of the optimum is in 
the output of the algorithm with probability at least 1/e, making this algorithm e-competitive for 
uniform matroids even under zero information. 

A slight modification of this algorithm is at least 2e-competitive for uniformly dense matroids in 
the random assignment model. The full procedure is depicted in Algorithm [TJ The only differences 
with respect to the algorithm above are that (i) The number of elements sampled is given by 
a binomial distribution Bin(n,p) and (ii) Before adding an element to the output, we test if its 
addition maintains independence in the matroid. 



^A circuit is a minimal non-independent set of a matroid. A loop is an element e such that {e} is a circuit. A 
loopless matroid is a matroid having all singletons independent. 



Algorithm 1 for uniformly dense matroids of n elements and rank r under random assignment. 

1: Maintain a set T containing the heaviest r elements seen so far at every moment (initialize T 
with r dummy elements that are supposed to be lighter than every element in the stream). 

2: ALG^0. 

3: Choose m from the binomial distribution Bin(n,p). 

4: Observe the first m elements and denote this set as the sample. 

5: for each element e arriving after the first m elements do 

6: if e enters T and the element leaving T is in the sample or is a dummy element then 

7: Check if ALG U {e} is independent. If so, add e to ALG. 

8: end if 

9: end for 
10: Return the set ALG. 

Theorem 3.2. Let ALG be the set returned by Algorithm {1\ when applied to a uniformly dense 
matroid A4 of rank r. Then 

T 

E,,,[it;(ALG(a))] > {-p^\uv)Y,Wi > ( V lnp)E,[it;(OPT^((j))]. 

i=l 

In particular, by setting p = e~^l'^ , we conclude that Algorithmic is 2e- competitive for uniformly 
dense matroids in the random assignment model. 

Proof. Consider the following offline simulation algorithm. In the first part of the simulation, each 
weight Wi in the adversarial list W selects an arrival time tj in (0, 1) uniformly and independently. 
The algorithm keeps a set T containing the top r weights seen at every moment (initially containing 
r dummy weights of negative value) and processes the weights as they arrive, sampling the ones 
arriving before time p. When a weight arriving after time p is processed, the algorithm marks it 
as a candidate if, first, that weight enters T, and second, the one leaving T is either in the sample 
or a dummy weight. In the second part of the simulation, the algorithm assigns to each candidate 
weight a different element of the matroid uniformly at random. Then, it runs the greedy procedure 
on the sequence of candidates in the order they appeared and returns its answer. Using that the 
cardinality of the sampled set has binomial distribution with parameters n and p, it is not hard to 
check that the set of elements and weights returned by this simulation has the same distribution 
as the one returned by Algorithm [T] For this reason, we focus on the simulation. 

We estimate the probability that each one of the top r weights appears in the output set. Focus 
on one such weight Wi, i < r, and let ^ be a nonnegative integer strictly smaller than r. Define Si as 
the event that exactly £ of the top r weights excluding Wi are sampled. Given that event £i occurs, 
the corresponding i high weights in the sample enter T as soon as they arrive and never leave it. 
Since every candidate pushes out either a dummy or a sampled weight of T at the moment it is 
marked, the previous implies that the number of candidates marked by the simulation algorithm is 
at most (in fact, exactly) r — £. Event Si occurs with probability (''^ )p^(l —pY~'^~^. 

Claim 1. For i < r — l,i < r and ti > p, VT[wi is marked as a candidate \£i,ti) > p/ti. 

Proof. Since Wi is one of the top r weights, it enters set T at the time it is considered. Thus, it 
will be marked as a candidate if and only if the weight leaving T at that time is either a dummy 
or a sampled weight. 



Let A(ti) = {wj : tj < ti} be the set of weights seen before Wi arrives. If this set has less than r 
elements then the element leaving T at ti will be a dummy weight. Consider the case where A{ti) 
has cardinality at least r and let Wj be the top r-th element of this set. Since Wj is not one of the 
top r elements in the full adversarial list, its arrival time tj is independent of £i. Therefore, 

PT{wi is marked as a candidate | £i,ti) 

= l-Fr{\A{ti)\ <r \ £i,ti) + Fr{tj < p \ tj < t^) Pr(|^(ti)| >r \ £i,ti) 

= l-Pvi\Aiti)\<r\£i,ti) + ^PT{\A{ti)\>r \£i,ti)>^. D 

The elements of the matroid assigned to the candidate weights form a random set. Conditioned 
on £1 and on Wi being a candidate. Lemma 13.11 implies that no matter what position Wi takes in 
the list of at most r — i candidates, the probability that it gets added to the output is at least 
1 — (r — i)/r = {£ + ^)/r; therefore, the probability that Wi appears in the output is at least 

Theorem 13.21 follows easily from here. D 

We remark that Algorithm [1] does not need to learn the weights of the elements: the algorithm 
can proceed by only learning the relative order of the weights seen so far. Also, we note that this 
algorithm is not constant competitive in the zero information model. In fact, if we had such an 
algorithm A for uniformly dense matroids under zero information we could obtain one for general 
matroids by using that every matroid Al is a restriction of a uniformly dense matroid M' [17j . 
The algorithm for Ai would virtually complete the matroid Ai' by adding a dummy set of zero 
weight elements and run algorithm A on ^A' , simulating the augmented input in such a way that 
the dummy elements arrive uniformly at random similarly to the real ones. 

4 Principal sequence and general matroids. 

In this section we need the concept of principal sequence of a matroid. This notion is related to the 
theory of principal partition of graphs, matroids and submodular systems, which has applications 
to connectivity and reliability of networks and to resource allocation problems. The theory of 
principal partition is extensively analyzed in a monograph by Narayanan [20] and in a recent survey 
of Fujishige [14]. The definition of principal sequence of a matroid we present was introduced by 
several authors under different names (See, e.g., [T9 | [23 l [21] ) . 

Theorem 4.1 (Principal Sequence). Let M. = {E,I) be a loopless matroid with rank function r. 
There is a sequence of sets % = Fq C- Fi ^ ■ ■ ■ C^ F^ = E and a sequence of values cxo > Ai > A2 > 
• • • > Afc > 1 satisfying: 

1. The values Ai, . . . , A^ are the only ones for which the submodular set function fx. := Xir{X) — 
\X\ admits more than one minimizer. 

2. For every 1 < i < k, the unique minimal and unique maximal minimizer s of the function fx^ 
are -Fj_i and Fi respectively. 

The sequence {Fi)^^Q is called the principal sequence of Ai and (Aj)f^]^ is the associated sequence 
of critical values. 



From this definition, it is not hard to obtain the fohowing lemma (it also follows from [14^ 
Theorem 3.11] or [8]). 

Lemma 4.2. Let M. = {E,1) be a loopless matroid with principal sequence (-Fi)f=o ^''^^ critical 
values (Aj)jL]^. Then, for every 1 < i < k, the matroid A4i = {M/Fi^i) \. . , obtained by 

\ F-\ P- I 

contracting Fj_i and restricting to Fi \ Fj_i is uniformly dense, with density ^ (f.\f^ ) = ^i- 
These matroids are known as the principal minors of M. 

The principal sequence of M. can be constructed by iteratively finding the maximal densest set 
Fi of the current matroid, adding it to the sequence and contracting it on the matroid, until all the 
elements have been contracted. Polynomial time algorithms to compute the principal sequence of 
a given matroid can be found in the literature (see, e.g. j21) or [2^ chapters 10,11]). 

We use Lemma 14.21 to design an algorithm for the matroid secretary problem under random 
assignment in a general (not necessarily loopless) matroid M = {E,I). Let {Aii)i^i be the sequence 
of principal minors of the loopless matroid obtained by deleting the set Eq of loops from A4. For 
every i, let Ei = Fi\ i*i_i and r^ denote the ground set and rank of M.i respectively. Note that the 
family {Eq, . . . , E'fc} is a partition of the ground set E. Define matroids M' and V with ground set 
E as follows: 

1{M') = \\Jli: ha l{Mi) \ , 1{V) = \\J h : h <^ E^, \Ii\ < r^ i . 

Since any independent set in M.' is, by definition of each TWj, also independent in 7W, Algo- 
rithm [2l described below returns an independent set of M. 

Algorithm 2 for General Matroids of n elements and rank r under random assignment. 

1: Compute the sequence of principal minors (A^j)JL| of the matroid obtained by removing the 

loops of M. 
2: Run Algorithm [T] in parallel on each Mi and return the union of the answers. 

Theorem 4.3. Algorithmic is 2e/(l — 1/e) = 2e^ j(e — 1) -competitive for general matroids in the 
random assignment model. 

To prove Theorem 14. 3( we compare the weight of the set ALG returned by Algorithm [2] with 
the optimum of the partition matroid V defined above. Since both A4' and V are disjoint union 
of uniformly dense and uniform matroids over the same ground set and having the same rank, 
we expect them to behave similarly. Observe that the random permutation o" : [n] — ?> [n] that is 
used to assign the weights of the adversary to the elements of the matroid can be viewed as the 
composition of a random partition of [n] into blocks of size (IE'jDJLq, and a collection of random 
permutations inside each block. Conditioned on the random partition, each block Ei receives a 
hidden list of weights which are assigned uniformly at random to the elements of the block. Since 
each Mi is uniformly dense and the elements of Ei arrive in random order. Theorem 13.21 implies 
that Algorithm [T] recovers, in expectation, at least l/(2e)-fraction of the combined weight of the 
heaviest ri elements of Ei, where the expectation is over the random permutation of that particular 



block. Noting that the union of the heaviest rj elements of each Ei is exactly the optimum of the 
partition matroid V defined above, we conclude, by removing the conditioning, that 

E^[w{0FTMi<7))] > E^[w{0FTM'i<7))] > E,,^[u;(ALG)] > E^[u;(0PTp(a))]/(2e). (1) 

To prove that Algorithm [2] is constant competitive we only need to show that the optimum of 
V is only a constant away from the optimum in A4. 

Lemma 4.4. E^[it;(OPT-p(cj))] > (1 - l/e)E^[ii;(OPTx(cr))]. 

To prove this lemma we note the following fact. For all j, let A°" = {eo-(i) , eo-(2) > • • • > eo-(j) } 
denote the (random) set of the elements that receives the top j weights. Then, for any matroid: 

n n 

E^[wiOFT{a))] = J]^, PrKo-) span(AJ_i)] = J]^,(E.[r(ylJ)] - E.[r(AJ_i)]) 

71-1 

= EMK)\wn + ^E^[r(^J)](«;, - w,+{). (2) 

i=i 

In order to prove Lemma 14.41 we only need to show the following. 
Lemma 4.5. For every l<j<n, E^[rp(AJ)] > (1 - l/e)E^[r^(^J)]. 
Proof. Since P is a partition matroid with non-trivial parts Ei, . . . ,Ek, we have: 

k k 

E,[rp(AJ)] =^E,[rp(^Jn^i)] =I]E,[min(|^Jn^i|,ri)]. 
j=i 1=1 

For each i, V\Ei is a uniform matroid with the same density Aj = \Ei\/ri as the corresponding 
uniformly dense matroid Aii. By Theorem 14.11 these densities are strictly decreasing with i. Call 
a part Ei dense if Aj > n/j and sparse otherwise. Intuitively, when a part Ei is dense we expect 
Ay" n Ei to contain at least \Ei\{j/n) > ri elements and thus we expect the rank of A'^ D Ei in the 
partition matroid to be close to rj. On the other hand, for sparse parts this quantity should be 
closer to E[|A^ n Ei\] = \Ei\{j/n). We formalize this intuition in the following claim. 

Claim 2. // Aj > n/j, then E,a[rp{A'^ n Ei)] > (1 — l/e)rj. // on the other hand, Aj < n/j, then 
E,[rp(AJ n Ei)] > (1 - l/e)\E,\{j/n). 

Proof. Focus on a part Ei and split its elements into rj pieces as evenly as possible. To do this, let 
£« = Aj — [AjJ and create rj(l — e^) pieces of size Aj — e^ = [AjJ and rjej pieces of size Ai + 1 — e^ = 
[AjJ +1. It is easy to see that both rj(l — e^) and rje are integers and that the previous construction 
is indeed a partition of Ei into tj pieces as claimed (note that if Aj = \Ei\/ri is an integer, this 
partition consists simply on tj pieces of size Aj). 



The rank of any set in V\Ei is at least as high as the number of pieces of Ei this set intersects; 
therefore, E[rp{A'j D Ei)] is at least 

B:Pieceof£;, B : Piece of £; \ ^j' / B : Piece of Ei \ £=0 



J' / B-.Vicccoi Ei \ £=0 ^ ' 

> Y. i-(i-i/^)"" 

B: Piece of Ei 

= ri(l - e,) (l - (1 - j/n)^^-'^) + nei (l - (1 - j7n)^»-^'+i) 
= n{l-{l- Jlnf^-'^ (1 - eo/n)) 



The function (1 — e ^) is increasing, thus if Aj > n/j, 

E[rp(^J n Si)] > ri{\ - e-^'^V") > ^,(1 - e"^). 
On the other hand, the function (1 — e~^)/x is decreasing, thus if A^ < n/j, 

E[rp(^J n E,)] > r,(l - e-^^-'V") = (r,Ad/n)^l^^— ^ > (|i?.|j7n)(l - e"!). D 

Since (Aj)^^^ is a decreasing sequence, there is an index i* such that Ei is dense if and only if 
I < i < i*. Recall that IJi=i -^i is equal to the set Fi* in the principal sequence of the matroid 
A4\Eq. Since every set in the principal sequence has the same rank in both A4 and V we get: 



E^[rM{A'^)] < E.[r^(AJ n F^*) + r^(AJ H {E \ F,*))] < rM{F^^) + E^[|AJ n (S \ F,.)| 

i* k k 

= E^i+ E l^.l(jyn)<5]E.[rp(ylJni?,)]/(l-l/e)=E.[rp(^J)]/(l-l/e).n 
By combining Lemma 14.41 with inequality (JTJ we conclude the proof of Theorem 14.31 

5 Algorithms for the zero information model 

We give various algorithms for different classes of matroids in the zero information model. In all 
of them we assume the matroid is loopless (in the random assignment model we can not make this 
assumption since the introduction of loops changes the distribution of the weights of the elements). 

5.1 Cographic Matroids 

In any 3-edge-connected graph G we can find three spanning trees Ti, T2 and T3, such that the 
union of their complements covers E{G) (This follows from e.g. Edmonds's Matroid Partitioning 
Theorem [H]). The sets Bi = E\Ti are bases in the cographic matroid of G. Consider the following 
algorithm for this matroid. Select i E {1,2,3} uniformly at random and accept all elements in Bi. 
Since every edge of G is selected with probability at least 1/3, this algorithm is 3-competitive. 
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We modify this algorithm to work on the cographic matroid Ai of any graph G. First, remove 
all the bridges of G since they are loops in M. Decompose the edge set of the remaining graph 
as the direct sum of 2-edge-connected components. For each component C, let C" be the graph 
obtained by contracting all but one edge in each serial class of its corresponding graphic matroido 
Each graph C is then 3-edge-connected and, as before, we can find three bases Bi, B2 and B^ of 
the cographic matroid of C" covering E{C'). The algorithm for A4 is as follows. Independently for 
each component G, select an index i € {1,2,3} uniformly at random and run the e-competitive 
algorithm of [3] on the partition matroid that accepts at most one edge of G from each parallel class 
of A4 represented in Bi (discard every element of G not represented in Bi). Since every element of 
the optimum base of A4 is the heaviest of its parallel class and each parallel class of A4 is selected 
with probability at least 1/3, we conclude the previous algorithm is 3e-competitive. 

Theorem 5.1. For any cographic matroid M., the previous algorithms is ?,e- competitive. Further- 
more, if the graph G associated to A4 is 3-edge-connected, the algorithm is ^-competitive. 

5.2 Low Density Matroids 

A generalization of the previous algorithm is the following. Given a loopless matroid M = {E,Z) 
of density 7(A1), the vector v S M^ having all its coordinates equal to 1/7(7W) is feasible in 
the matroid polytope. In particular, this vector has a decomposition as convex combination of 
independent sets of A4: v = X^/gjA/x/j which we can find in polynomial time. The algorithm 
for matroid Ad will select an independent set / G X at random, according to probabilities {\i)i^x 
and accept its elements without looking at their weights. Since every element e is selected with 
probability Yli&xi^e^i — ^e = '^hi-M), this algorithm is 7(7W)-competitive. 

If matroid Ai contains parallel elements, we could get a better competitive ratio by considering 
the simple matroid AA' = {E',I') obtained by removing all but one edge in each parallel class of 
A4. By combining the output /' of the previous algorithm applied on A4' with the e-competitive 
algorithm for the partition matroid that selects one element in each parallel class represented in /' 
(similar to what we did for cographic matroids), we obtain a e7(7V4')-competitive algorithm. 

Theorem 5.2. For any matroid Ai, the first algorithm described is j{Ai)- competitive, and the 
second algorithm is j{Ai')e-competitive. 

5.3 Matroids with small cocircuits 

For each element e of a loopless matroid Ai = {E,I), let c*(e) be the size of the smallest cocircuit 
(i.e. circuits of the dual matroid) containing it, and let c*{Ai) = maxg c*(e). Consider the algorithm 
that greedily construct an independent set of A^ selecting elements as they appear without looking 
at their weights. We claim this algorithm is c*(A^)-competitive. To see this, fix an element e a E 
and let G* be a cocircuit of minimum size containing it. If e appears before all the other elements 
of G* in the random order then it has to be selected by the algorithm. Otherwise, there would be 



^Two elements are in series in a graphic matroid if and only if they are in parallel in the cographic matroid. A 
pair of elements {e, /} are in parallel in a matroid, if the set {e, /} is a circuit. Being in series and being in parallel 
are equivalence relations, so the serial and parallel class of an element are well defined. Contracting all but one edge 
in each serial class of the graphic matroid corresponds to deleting all but one element in each parallel class of the 
cographic matroid. For the specific case of the cographic matroid of a graph, a set of elements are in parallel if each 
pair of them is a minimal edge cut of the graph. 
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a circuit C that intersects C* only in element e, which is a contradiction (See, e.g. }22- Proposition 
2.1.11]). It is not hard to prove that for every matroid M, li-M.) ^ c*{M.). This mean that this 
algorithm is no better than the one for low density matroids. However, this algorithm is much 
simpler. 

Theorem 5.3. For any matroid M., the algorithm described above is c*{M.)- competitive. 

5.4 Column-sparse linear matroids 

Let M = {V,I) be a linear matroid represented by a matrix A containing at most k non-zero 
values in each column. Consider the following algorithm: Randomly permute the rows of A and 
define for every row i, the sets Ci = {v £ V : Vi y^ 0} and Bi = Ci\ IJj^iCj, where vi denotes 
the i-th coordinate of column v in the permuted matrix. Next, run the secretary algorithm for the 
partition matroid that accepts at most one element of each Bi. We claim that any set returned by 
this algorithm is independent m M: If this was not the case there would be a circuit C inside the 
output. Let V € C he the element belonging to the set Bi of smallest index i. By definition of v, 
the elements oi C\v are not in Cj; therefore, C and Cj intersects only in v. This is a contradiction 
since Cj is in the cocircuit space of the matroid (Use, e.g. [22l Proposition 2.1.11]). 

We claim the previous algorithm is /ce-competitive. To see this, construct the bipartite graph 
G with parts the rows and columns of A, where there is an edge (j, v) if the corresponding entry 
of A is non-zero. Assign to each edge a weight equal to the one of its associated column in Ai. 
Consider the following simulation algorithm: Randomly permute the vertices in the row part of 
the graph. Delete all the edges, except the ones going from a column vertex to its lowest neighbor 
(the row having smallest index in the random permutation). Finally, run the secretary algorithm 
for the partition matroid that accepts for each row vertex, at most one edge incident to it. This 
returns a matching with the same weight as the set of elements the original algorithm returns. 

Since for every independent set of columns, the number of row vertices that this set dominates 
in G is at least its cardinality, Hall's Theorem implies that there is a matching covering each 
independent set. In particular the weight of the maximum weight matching M* in G is at least the 
one of the optimum independent set of Ai. On the other hand, M* has weight at most the one of 
the edge set {(z, v*{i)) : i G rows(A)}, where v*{i) is the maximum weight neighbor of i in G. Since 
each edge {i,v*{i)) is not deleted with probability 1/k and, given it is not deleted, the simulation 
selects it with probability 1/e, we conclude the original algorithm is fce-competitive. 

Theorem 5.4. The previous algorithm is ke- competitive for matroids representable by matrices 
having only k non-zero elements per column. 

Note that by applying this algorithm to graphic matroids, which are representable by matrices 
having only 2 ones per column, we recover the 2e competitive algorithm of Korula and Pal |16j . 

5.5 A new O(logr) competitive algorithm for matroids 

Babaioff et al. [S] present an O(logr) competitive algorithm for general matroids of rank r. This 
algorithm has many features, including the fact that it does not need to know the matroid be- 
forehand; it only needs to know the number of elements and have access to an oracle that test 
independence only on subsets of elements it has already seen. Nevertheless, this algorithm makes 
use of the actual values of the weights being revealed. We present an algorithm having the same 
features but that only uses the relative order of weights seen and not their numerical value. 
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If we are given the rank of the matroid, our algorithm is as follows. With probability 1/2, 
run the classical secretary algorithm that returns the heaviest element of the stream. Otherwise, 
observe the first m elements of the stream, where m is chosen from the binomial distribution 
Bin(n, 1/2) (as usual, denote this set of elements as the sample) and compute the optimum base 
A = {oi, . . . , flfc} (with w{ai) > • • • > w{aig)) of the sampled elements. Afterwards, select a number 
i G {1, 3, 9, ... , 3*} with t = [log3 r\ uniformly at random, run the greedy procedure on the set of 
non-sampled elements having weight at least the one of ai as they arrive and return its answer (if 
I > k, run the greedy procedure over the entire set of non-sampled elements). If we are not given 
the rank of the matroid beforehand, we select t £ {[log3fcJ, [log3 fcj -|- 1} uniformly and use this 
value in the previous algorithm. 

The optimum of the sample is similar to the optimum of the nonsampled part: For any number i 
the algorithm can choose, there is an independent set of size close to i outside the sample with every 
element heavier than a£ (with high probability); therefore, the greedy procedure recovers a weight 
of roughly iw{ai). By taking the expectation over the choices of i it is not hard to check that the ex- 
pected weight returned by the algorithm is at least r2(E[it;(yl)/log3(r)]) = r2(E[ti;(OPT)/log3(r)]). 
We give the formal proof below. 

Theorem 5.5. The algorithm described above is Oi\ogr)-competitive for any matroid of rank r. 

Proof. Assume first that the rank r of the matroid is known. Let OPT = {ei, . . . , e^} with w{ei) > 
• • • > w{er) be the maximum independent set of the matroid and A = {ai, . . . , a^} be the optimum 
of the sample, i.e., the optimum of the first Bin(n, 1/2) elements in the stream (independent of 
whether the algorithm computes A or not). Note that every element of the matroid is sampled 
independently with probability 1/2, including the elements of the optimum. Therefore, 

E[..(^)l > ^^. (3) 

To simplify our analysis, in the following we assume that for i > k, ai is a dummy element with 
w{ai) = 0. Given the number i chosen by the algorithm (if the algorithm reaches that state), the 
weight of the set returned will be at least w{a£) times the number of elements the greedy procedure 
selects; therefore, E[tf (ALG)] is at least 

— ^ + -^ n ^ V IE \w{af) ■ lALGI \ £ = 3^ was selected] . 

2e 2(l+Llog3rJ) A. L ^ ^^ I I I J 

Let H{ai) be the collection of non-sampled elements that are heavier than a^. If the algorithm 
chooses the number i, it will then execute the greedy procedure on H^ai) and return a set of 
cardinality equal to the rank of H{ae). Note that for every i, w{ei) > w{ai); therefore, the rank 
of H{a() is at least the number of nonsampled elements in {ei, . . . , e^}. By Chernoff bound, the 
probability that this last quantity is smaller than £/4 is at most exp(— £/8). 

In particular, if £ > 9, E[u'(a^) • |ALG[ I t] > E[u>(a^)](l - exp(-^/8))^/4 > E[w{ae)]£/6. There- 
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fore, 

( -. -, UogsH 

L ^ '^- 2e 12(1+ [logs rj) ^ ^ ^ ^"^ 



j=2 



> E 



> 



w[ai,...,as) , 1 Y^ / n 
We + 24(l+Llog3rJ) ^ Mas^V • • ,a3.+^-i) 

E[w{A)] 



J=2 



16e(l+Llog3rJ)- 
Using inequality ([3]), we get 

u;(OPT) 



E[u'(ALG)] > 



32e(l + Llog3rJ)' 



which implies the algorithm is 0(logr)-competitive. 

Suppose now that the rank r is unknown. If r is small, say r < 12, then with probability 
l/(2e) the algorithm will run the standard secretary algorithm and return the top element of the 
matroid. This element has weight at least 1/12 fraction of the optimum; therefore the algorithm is 
24e-competitive for this case. 

For the case where r > 12 we use a different analysis. The random variable k denoting the 
rank of the sampled set could be strictly smaller than r. However, the probability that k < r/3 is 
small. Indeed, for that event to happen we require that at most 1/3 of the elements of OPT are in 
the sample. By Chernoff bound, this happens with probability exp(— r/18) < exp(— 13/18) < 1/2. 
Noting that r/3 < k < r implies that [log3rJ E {[log3/cJ, [log3 A;J + 1}, we deduce that with 
probability at least 1/4 our algorithm guesses t = [log3 r\ right; therefore, the competitive ratio of 
this algorithm is at most 4 times worse than the one that knows the rank beforehand. D 
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